SEQ ID N0:1 

G21250/C 

LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
COMMENT 



OLIGO 

G21250 464 bp DNA linear STS 31-MAY-19 

human STS WI- 11603, sequence tagged site. 

G21250 

G21250.1 GI:1341576 

STS; STS sequence; primer; sequence tagged site. 

Homo sapiens STSs derived from sequences in dbEST and the Unigene 
collection . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (bases 1 to 464) 
Hudson, T. 

Whitehead Institute/MIT Center for Genome Research; Physically 
Mapped STSs 
Unpublished (1995) 

Contact : Thomas Hudson 

Whitehead Institute/MIT Center for Genome Research 

Whitehead Institute for Biomedical Research 

9 Cambridge Center, Cambridge MA 02142 USA 

Tel: 617 252 1900 

Fax: 617 252 1902 

Email : thudson@genome . wi . mit . edu 



Primer A: GGGAGTTGATTATGTTGTAGAATCA 
Primer B: AG AAG CG AT C AG AC CT CT ATGC 
STS size: 106 
PCR Profile: 

Presoak: 
Denaturation: 
Annealing: 56 degrees C 
Polymerization : 
PCR Cycles: 35 
Thermal Cycler: 
Protocol : 

Template: 10 ng 

Primer: each 5 pM 

dNTPs : each 4 nM 

Taq Polymerase: 0.025 units/ul 

Total Vol: 20 ul 



Buffer: 

MgCl2: 1.5 mM 
KC1: 50 mM 
Tris-HCL: 10 mM 
pH: 9.3 



Derived from dbEST (genbank accession R08904) . 
FEATURES Location/Qualifiers 
source l. .464 

/ organi sm= " Homo sapiens " 
/db_xref ="taxon: 9606" 

/map="4 21.3 cR from top of Chr3 linkage group" 

STS 29. .134 

primer_bind 29. .53 

primerjbind complement (113 . .134) 

BASE COUNT 101 a 104 c 120 g 130 t 9 others 

ORIGIN 



Query Match 16.6%; Score 138; DB 11 ; Length 464; 

Best Local Similarity 100.0%; Pred. No. 6.1e-54; 

Matches 13 8; Conservative 0; Mismatches 0; Indels 0; Gaps 
Qy 694 C TAG C C AAAG CTGG C AC AG AAG AAG C AAT CGTGT ATT C AG AC AT AG AC CTG AAG AAG CTG 753 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 

Db 228 CT AGC C AAAG CTGG C AC AG AAG AAGC AAT CG TG T AT T C AG AC AT AG AC CTG AAGAAG CTG 169 



Qy 754 GCTGAAATACGCCAGCAAATCCCCGTTTTTAGACAGAAGCGATCAGACCTCTATGCTGTG 813 

IMMIMIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIIIIMIMIMIIIII 



Db 



168 GCTGAAATACGCCAGCAAATCCCCGTTTTTAGACAGAAGCGATCAGACCTCTATGCTGTG 10 9 



Qy 814 G AG ATG AAAAAG C C CT AA 831 

MIIIIIIIIMIIIIII 

Db 108 GAGATGAAAAAGCCCTAA 91 



SEQ ID N0:2, Standard, AA database 
Q9NQR4 

ID Q9NQR4 PRELIMINARY; PRT; 276 AA. 

AC Q9NQR4 ; 

DT Ol-OCT-2000 (TrEMBLrel , 15, Created) 

DT Ol-OCT-2000 (TrEMBLrel. 15, Last sequence update) 

DT 01-MAR-2002 (TrEMBLrel. 20, Last annotation update) 

DE NIT protein 2 (CUA002) . 

GN NIT2 . 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N . A . 

RX PubMed= 10959838; 

RA Pace H.C., Hodawadekar S.C., Draganescu A., Huang J. , Bieganowski P., 

RA Pekarsky Y. , Croce CM., Brenner C; 

RT "Crystal structure of the worm NitFhit Rosetta Stone protein reveals 

RT Nit tetramer binding two Fhit dimers . " ; 

RL Curr. Biol. 10:907-917(2000). 

RN [2] 

RP SEQUENCE FROM N. A. 

RC TISSUE -ADRENOCORTICAL TUMOR SECRETED ACTH; 

RA Xu X., Yang Y,, Gao G., Xiao H., Chen Z., Han Z.; 

RL Submitted (MAY-2000) to the EMBL/ GenBank/DDBJ databases. 

DR EMBL; AF284574; AAF87103.1; -. 

DR EMBL; AF260334; AAG44665.1; -. 

DR InterPro; IPR003010; Ntlse/CNhydtse . 

DR Pfam; PF00795; CN_hydrolase ; 1. 

SQ SEQUENCE 276 AA; 30608 MW; 32FA79760 1A4 19C6 CRC64 ; 

Query Match 100.0%; Score 1438; DB 4; Length 276; 

Best Local Similarity 100.0%; Pred. No. 1.6e-124; 

276; Conservative 0; Mismatches 0; Indels 0; Gaps 

MTSFRLALIQLQISSIKSDNVTRACSFIREAATQGAKIVSLPECFNSPYGAKYFPEYAEK 60 

IMMMMIIIIIMIIIIIMIIIIIIIIIIIIIIIIIIIIIIMIIIIIIlllllll 

MTS FRLAL I QLQ I SS I KSDNVTRACS F IREAATQGAKI VS LPECFNS P YGAKYFPE YAEK 60 
IPGESTQKLSEVAKECSIYLIGGSIPEEDAGKLYNTCAVFGPDGTLLAKYRKIHLFDIDV 120 

IIIIMMMIIIIIIIIIIMIIIIIIII IIIIIIIIIIII.IIIIIIIIIIIM 

I PGESTQKLSEVAKECS I YLIGGS I PEEDAGKLYNTCAVFGPDGTLLAKYRKIHLFD IDV 12 0 
PGKITFQESKTLSPGDSFSTFDTPYCRVGLGICYDMRFAELAQIYAQRGCQLLVYPGAFN 18 0 

IIIIMMMMIIIIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIIIIIIIIIIIIIII 

PGKI TFQESKTLS PGD S FS TFDTP YCRVGLG I C YDMRFAELAQ I YAQRGCQLLVYPGAFN 180 
LTTGPAHWELLQRSRAVTDNQVYVATASPARDDKASYVAWGHSTVA/NPWGEVLAKAGTEEA 24 0 

: 1 i 1 1 i i i i i i i i , - 

LTTGPAHWELLQRSRAVDNQVYVATASPARDDKAS YVAWGHSTVWPWGEVLAKAGTEEA 24 0 

IVYSDIDLKKLAEIRQQIPVFRQKRSDLYAVEMKKP 276 
I ! I I ' I 



Matches 


Qy 


l 


Db 


l 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 



SEQ ID NO: 2, Standard 
AF284574 

LOCUS AF284574 962 bp mRNA linear PRI 24-JUL-2000 

DEFINITION Homo sapiens Nit protein 2 (NIT2) mRNA, complete cds . 

ACCESSION AF284574 

VERSION AF284574.1 GI: 9367115 

KEYWORDS 

SOURCE Homo sapiens. 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 {bases 1 to 962) 

AUTHORS Pace,H.C, Hodawadekar , S . C . , Draganescu, A. , Huang, J., 

Bieganowski, P. , Pekarsky,Y., Croce,C,M. and Brenner , C . 
TITLE Crystal structure of the worm NitFhit Rosetta Stone protein reveals 

a Nit tetramer binding two Fhit dimers 
JOURNAL Curr. Biol. 10 (15), 907-917 (2000) 
MEDLINE 20414396 
PUBMED 10959838 
REFERENCE 2 (bases 1 to 962) 

AUTHORS Pace,H.C, Hodawadekar , S . C . , Draganescu, A. , Huang, J., 

Bieganowski, P. , Pekarsky,Y., Croce,C.M. and Brenner, C. 
TITLE Direct Submission 

JOURNAL Submitted ( 05 - JUL-2000 ) Kimmel Cancer Center, Thomas Jefferson 

University, 233 S 10th Street, Rm. 826, Philadelphia, PA 19107, USA 
FEATURES Location/Qualifiers 
source 1. .962 

/organism="Homo sapiens" 
/db_xref= M taxon: 9606" 
gene l. .962 

/gene="NIT2" 
CDS 22. .852 

/gene="NIT2" 

/note="Nit2 ; similar to the Nit domains of Caenorhabdit is 

elegans NitFhit and Drosophila melanogaster NitFhit" 

/codon_start=l 

/product="Nit protein 2" 

/protein_id="AAF87103 .1" 

/db_xref= n GI : 9367116" 

/translation^ "MTSFRLALIQLQISSIKSDNVTRACSFIREAATQGAKIVSLPEC 

FN S P YG AK YF P E YAE K I PG E S TQ KL S E VAKE C S I YL I GG SIP E ED AG KL YNTC A V FG P 

DGTLLAKYRKIHLFDIDVPGKITFQESKTLSPGDSFSTFDTPYCRVGLGICYDMRFAE 

LAQ I Y AQRGCQ L L VY PG AFNLTTG P AHW E L LQ R S RA VDNQ VYVAT AS P ARDDKAS YVA 

WGHSTWNPWGEVLAKAGTEEAIVYSDIDLKKLAEIRQQIPVFRQKRSDLYAVEMKKP 
it 

BASE COUNT 283 a 208 c 218 g 253 t 

ORIGIN 



Alignment Scores : 



Pred. No. : 


1.37e-137 


Length : 


962 


Score : 


1438 .00 


Matches : 


276 


Percent Similarity: 


100 . 00% 


Conservative : 


0 


Best Local Similarity: 


100.00% 


Mismatches : 


0 


Query Match: 


100 . 00% 


Indels : 


0 


DB: 


9 


Gaps : 


0 



US-09-769-952-2 (1-276) x AF284574 (1-962) 



Qy 


i 


Db 


22 


Qy 


21 


Db 


82 


Qy 


41 


Db 


142 



MetThrSerPheArgLeuAlaLeuIleGlnLeuGlnlleSerSerlleLysSerAspAsn 20 

NIIIIIIIIIMIIIIIIIIIMIIIIIIIIIIIIIIMIIIIIIMMMIIIMIII 

ATGACCTCTTTCCGCTTGGCCCTCATCCAGCTTCAGATTTCTTCCATCAAATCAGATAAC 8 1 



MIMIMIIIIIIIIIIIIIIIIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIII 

GTCACTCGCGCTTGTAGCTTCATCCGGGAGGCAGCAACGCAAGGAGCCAAAATAGTTTCT 14 1 



IIIIIIIMIIIIIIIIIIIIIIIIIMIIIIIIIIIIIIHMIIIIIIIir 



Qy 



61 HeProGlyGluSerThrGlnLysLeuSerGluValAlaLysGluCysSerlleTyrLeu 80 



Db 


202 


ATT C CTGG TG AATCC AC AC AG AAGCTTT C TG AAG T AGC AAAGG AATG C AGC AT AT AT CT C 


261 


Qy 


81 


I leGlvGlvSerlleProGlllGluAsnAlaGl vLv<?L^TiTvrActnT'hrr , v-ctal aVal Ph*» 


100 


Db 


262 


ATTGGAGGCTCTATCCCTGAAGAGGATGCTGGGAAATTATATAACACCTGTGCTGTGTTT 


321 


Qy 


101 


GlyProAspGlyThrLeuLeuAlaLysTyrArgLysIleHisLeuPheAspIleAspVal 


120 


Db 


322 


GGGCCTGATGGAACTTTACTAGCAAAGTATAGAAAGATCCATCTGTTTGACATTG^ 


381 


Qy 


121 


ProGlyLysIleThrPheGlnGluSerLysThrLeuSerProGlyAspSerPheSerThr 


14 0 


Db 


382 


C CTGG AAAAATT AC ATTT C AAG AAT CT AAAAC ATTG AGTC CGGGTG AT AGTTT C T C C AC A 


441 


Qy 


141 


PheAsDThrProTvrCvsAraValGlvLeuGl vTl pfvqTvraejnMphamPVipal n 


i c n 


Db 


442 


T T TGAT ACTCCTT ACTGC AG AG TGGGT C TGGG CAT C TG CT ACGAC ATG CGGTTTGC AG AG 


501 


Qy 


161 


LeuAlaGlnlleTvrAlaGlnAraGlvCvsGlnTjFMiT.^nV;* 1 TvrPrr^l \/Al aDhoacn 


x a u 


Db 


502 


CTTGCACAAATCTACGCACAGAGAGGCTGCCAGCTGTTGGTATATCCAGGAGCTTTT 


561 


Qy 


181 


LeuThrThrGlyProAlaHisTrpGluLeuLeuGlnArgSerArgAlaValAspAsnGln 


2 00 


Db 


562 


CTGACCACTGGACCAGCCCATTGGGAGTTACTTCAGCGAAGCCGGGCTGTTGAT^TCAG 


621 


Qy 


201 


ValTyrValAlaThrAlaSerProAlaArgAspAspLysAlaSerTyrValAlaTrpGly 


22 0 


Db 


622 


GTGTATGTGGCCACAGCCTCTCCTGCCCGGGATGACAAAGCC^ 


1 

DOl 


Qy 


221 


HisSerThrValValAsnProTrnGT. vGl nVa 1 T,*»n AT ^T.^esai TrTVir-pi npi n m = 


24 0 


Db 


682 


C AC AGC ACCG TGGTG AAC C CT TGGGGGG AGGTT CT AGCC AAAGCTGG CArAnAAnAAfiri 


/ X 


Qy 


241 


IleValTyrSerAspIleAspLeuLysLysLeuAlaGluIleArgGlnGlnlleProVal 


260 


Db 


742 


AT CGTGT ATT C AG AC AT AG AC C TG AAG AAGCTGG CTG AAAT ACG C C AG C AAAT C C C CGTT 


801 


Qy 


261 


PheArgGlnLysArgSerAspLeuTyrAlaValGluMetLysLysPro 276 




Db 


802 


TT TAG AC AG AAGCG ATC AG AC C T CT ATG CTGTGG AG^ 84 9 





